Enhanced Fuzzy C-Means Clustering for Improved Data Analysis with Application to the Higgs Boson Dataset
Author Details
Journal Details
Published
Published: 11 November 2025 | Article Type : Research ArticleAbstract
Fuzzy C-Means (FCM) clustering is a widely used algorithm for data partitioning, particularly when clusters exhibit overlap. However, its performance can be significantly impacted by sensitivity to initial conditions and susceptibility to noise. This paper proposes an Enhanced Fuzzy C-Means (EFCM) algorithm designed to mitigate these limitations. EFCM integrates a density-based initialization strategy using kernel density estimation, employs the robust Mahalanobis distance metric to handle noise and outliers, and incorporates Silhouette index-based adaptive parameter selection. The algorithm’s effectiveness is evaluated on the challenging Higgs Boson dataset from Kaggle, a high-dimensional and noisy dataset commonly used in high-energy physics research. Results demonstrate EFCM’s superior performance compared to traditional FCM in terms of clustering accuracy, execution time, and precision.
Keywords: Enhanced Fuzzy C-Means, Clustering, Fuzzy Clustering, Initialization, Noise Handling, Parameter Selection, Silhouette Index, Mahalanobis Distance, Higgs Boson Dataset, Kernel Density Estimation.
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
Copyright © Author(s) retain the copyright of this article.
Statistics
18 Views
37 Downloads
Volume & Issue
Article Type
Research Article
How to Cite
Citation:
Naga Charan Nandigama. (2025-11-11). "Enhanced Fuzzy C-Means Clustering for Improved Data Analysis with Application to the Higgs Boson Dataset." *Volume 7*, 2, 1-5